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ABSTRACT 

One of the most promising methods of discovering nearby, low-mass planets in the habitable zones 
of stars is the precision radial velocity technique. However, there are many challenges that must 
be overcome to efficiently detect low-amplitude Doppler signals. This is both due to the required 
instrumental sensitivity and the limited amount of observing time. In this paper, we examine statistical 
and instrumental effects on precision radial velocity detection of extrasolar planets, an approach by 
which we maximize the planet yield in a fixed amount of observing time available on a given telescope. 
From this perspective, we show that G and K dwarfs observed at 400-600 nm are the best targets for 
surveys complete down to a given planet mass and out to a specified orbital period. Overall we find 
that M dwarfs observed at 700-800 nm are the best targets for habitable-zone planets, particularly 
when including the effects of systematic noise floors. Also, we give quantitative specifications of the 
instrumental stability necessary to achieve the required velocity precision. 
Subject headings: Extrasolar Planets, Astronomical Instrumentation, Astronomical Techniques 



1. INTRODUCTION 

Over 400 extrasolar planets have been discovered by 
surveys using the pr ecision radial velocity (PRV) method 
(|Wright et al.ll201lD . The first discoveries of planets us- 
ing the PRV method were mainly massive planets in 
close-in orbits, known as hot Jupiters, revealing a sur- 
prising diversi ty in pla. i ietary systems (jMavor fc Queloj 
[l995; Butler fc Marcvl 119961 ). More recently, techni- 
cal advances have resulted in discoveries of planets 
with masses interme diate to terrestrial and gas giants, 
the "Super Earths" (iMcArthur et al.ll2004HRivera et al 



2005| lUdrv et al.l 120071 : iHoward et al.l 120111: iPepe et al 
201 Ih . Future instrumentation promises the sensitivity 



needed to detect Earth analogs around Sun-like stars, but 
there are substantial challenges that must be overcome 
to attain this level of precision. 

There are three kinds of limits to how well one can re- 
cover a radial velocity signal from a target star. First, 
there are statistical limits that come from the signal- 
to-noise ratio and the depth, density, and shape of the 
spectral lines. These effects may be further separated as 
having components that come from the physical proper- 
ties of the star, such as the stellar luminosity and dis- 
tance to the star, which affects the signal-to-noise ratio 
of the spectrum; the chemical composition, which sets 
the number and depth of spectral lines; and the rota- 
tion rate, which affects the broadening of the spectral 
lines. Additionally, there are effects that come from ob- 
serving, such as the exposure time, telescope diameter, 
system throughput (sky to detector), and detector noise, 
which affect the overall signal-to-noise ratio; the reso- 
lution of the spectrometer and the sampling of the line 
spread function, which affect the width and clarity of 
the observed lines; and the decision of which wavelength 
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range to observe. This latter point is notable as instru- 
ments are optimized to observe in a particular region of 
the electromagnetic spectrum, and the line density and 
quality can vary significantly between various types of 
stars in different regions of their spectra. Furthermore, 
for ground-based studies, there are regions where telluric 
absorption makes observing impossible. 

Even if all the negative effects above are minimized, 
there is still an important second class of problems which 
will hurt the velocity precision: the inability to properly 
control and/or characterize the changes in the instru- 
mental profile of the spectrometer from night to night. 
Guiding errors, such as a star moving on a slit, can be a 
major component of this, as displacements in the center 
of light lead to skews in the instrumental profile. Ad- 
ditionally, small changes in the ambient pressure, tem- 
perature, etc. lead to changes in the optical path, which 
are degenerate with velocity shifts. This was realized as 
being a f undamental li mit to velocity precision over 45 
years ago (|GrifEnlll967[ ) , and successful attempts to miti- 
gate this require control of the environmental conditions 
and also active modeling of the fiuctuating instrumental 
profile. 

Finally, even in the limit of a perfect instrument and 
infinite observing time, there are wavelength-dependent 
stellar effects that can mimic the signal of a planet or- 
biting a star, such as starspots, stellar activity (jitter), 
and quasi-periodic oscillations of the stellar photosphere. 
These deleterious effects may be mitiga ted somewhat by 
clever observing strategies or modeling ()Dumusaue et akl 
l2011bl lal). and the most successful programs are some- 
times limited by this class of pr oblems. 

Ma ny stu d ies, n otably [ Bouchv et al.l (|200H ): 
Rein ers et al.l ([20Tol ): iWang fc Gc (201l|liave con- 
sidered the best ways to detect planets around stars. 
However, we take a somewhat different approach, where 
our end goal is to optimize a radial velocity survey 
for sensitivity and detection efficiency. Optimizing an 
observing plan to detect a planet around a particular 
star has a different set of requirements than a plan that 
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seeks to discover the maximum number of planets, or 
discover a particular class of planets. When considering 
how to optimize a survey, all the effects mentioned in 
the previous paragraphs must be considered together 
with the expected velocity signals caused by planets. 
Less massive stars are dimmer, but have higher velocity 
signals from similarly-sized planets. Additionally, there 
are more nearby low-mass stars than high-mass stars, 
given the present-day stellar mass function. In order to 
select targets for a survey, a balance must be struck, 
which will depend critically on the wavelengths of 
observation. As we will show, it will also depend on the 
particular kind of survey under consideration. 

In this work, we will characterize the statistical limita- 
tions in radial velocity observations under the consider- 
ation of finite observing time. We will consider the most 
productive way to choose targets for a survey in order 
to maximize the recovery of planets per unit of observing 
time. 

We arrive at our conclusions by the following chain of 
reasoning: in Section 2, we examine the precision achiev- 
able on different types of stars at a fixed distance and 
with fixed observing time, assuming a perfect instrument. 
In Section 3, we then relax the assumption of fixed ob- 
serving time and discuss the time necessary to detect 
different types of planets. We remove the assumption of 
a fixed stellar distance by considering nearby stars and 
the effects of the present-day mass function, and discuss 
the optimal way to select targets as a function of observ- 
ing wavelengths. In Section 4, we discuss the effects of 
instrumental noise floors, how they arise, and how they 
affect the results in the previous sections. 

2. EFFECTS OF STELLAR TYPE 

Previous studies have sought to characterize the fun- 
damental radial- velocity quality of different stellar spec- 
tra, considering the number density and depth of spec- 
tral lines. Additionally, these studies have examined the 
fundamental limit ations set by the sign al-to-noise of an 
observation (e.g. iBouchv et al.l ()2001| )). These effects 
are not completely separate; they depend on the spec- 
trum's shape and its specific intensity, modulated by the 
effect of stellar size and distance. If velocity precision 
is the ultimate goal, this requires additional considera- 
tion of at least one other thing, that being the observing 
wavelengths. 

We choose to examine the effects of spectrum and 
wavelength range on velocity p recision. We begin by us- 
ing the latest spectral m odels (lAllard et al.l 120 111) based 
on the PHOENIX code (jHauschildt et al.Ml999r in the 
Toff range of 2600 to 6200K, [Fe/H]=0, from 0.3 to 2.5 
fim. For stars of Toff = 2600 - 3400 K, we choose models 
where log 5 = 5.0; for 3600-5800 K we use log g = 4.5, and 
for 6000-6200 K we use log g = 4.0. In order to simulate 
the effects of instrument resolution, we convolve the mod- 
els with a generic Gaussian instrumental profile equiva- 
lent to a resolution of A/ A A = 75000 and a sampling of 
3.0 pixels per resolution element. Next, we rotationally 
broaden the spectra of stars with TeS > 3600K to match 
a disk-integrated vs'mi ~ 2.0 km/s. For c o oler s tars, 
we use V sini values taken from iReiners et all ()2010l ) (see 
TableQlfor the exact values). This can be considered the 
perfect observation, with the only degradation of spectral 
quality set by the spectrometer resolution and rotational 



broadening. To consider the effects of finite observing 
time, we then inject Poisson noise at a level equivalent 
to one minute of observing time, and Gaussian read noise 
at the level of 5 electrons/pixel spread over 10 pixels in 
the cross dispersion direction 0. Note that fainter stars 
will have a lower S/N ratio, as will wavelength bands 
away from the peak of the spectral energy distribution. 

Determining a radial velocity is equivalent to trying to 
recover the shift in a spectrum with respect to a reference 
spectrum, according to AA/A — v/c. Algorithms to do 
this include cross-correlation maximization, least-squares 
minimization, forward modeling, and various simplex al- 
gorithms. For our simulated observations, the degrada- 
tion of the spectrum with noise will introduce a spurious 
shift with respect to the convolved, non-noisy spectrum, 
which we recover using an algorithm where we fit the 
peak of the cross-correlation function with a polynomial. 
We checked that this converges to zero velocity shift lin- 
early the signal-to-noise ratio approaches infinity, as ex- 
pected. This precision is what would result from having 
a perfect reduction pipeline, regardless of the specifics of 
the calibration method. 

As a measure of our velocity precision, we repeat this 
simulated observation three hundred times, and take the 
standard deviation of the radial velocities as a measure 
of the velocity precision, {ay ) . In order to assess the best 
wavelengths of interest and target spectral classes, we 
repeat this simulation for wavelength regions of size 100 
nm and stellar effective temperatures from 2600-6200 K 
at steps of 200 K, from 0.3-2.5 ^m, with an assumed 
distance to the star of 10 pc. The results are shown in 
Figure [H 

The main conclusion of this numerical experiment is 
that in terms of best achievable radial velocity precision 
in a fixed amount of observing time for a star at a fixed 
distance, there is little advantage to moving redward of 
600 nm, and the best overall area to observe for a large 
range of spectral classes is the region between about 400 
and 600 nm. This result is based on the convergence 
of a few physical effects. First, the spectral energy dis- 
tribution peaks in this wavelength range for stars in the 
temperature range 4750-6000 K. This gives an advantage 
in terms of signal-to-noise. Second, high spectral qual- 
ity occurs at these wavelengths, caused by deep atomic 
absorption features from metalsO While the cooler stars 
do share some of these features, their spectral energy 
distributions peak at redder wavelengths. This is evi- 
dent in the contours moving rightward in the 2600-4250 
K, 600-800 nm area. Additionally, the overall decrease 
in intensity of the stellar spectra at longer wavelengths 
leads to a general decrease in precision from left to right. 
Furthermore, the deep and rich molecular lines present 
in cooler stars partially offset the fact that the stars are 
smaller and dimmer, leading to bands of higher radial 
velocity precision from 1.4-1.6 /xm and 2.3-2.6 /xm, but 
the absolute level is still well below the precision from 
400-600 nm. Any increase in the exposure time will only 

* for the typical (high) signal-to-noise levels in precision radial 
velocity observations, read noise does not contribute significantly 
to the error budget, a fact that we verified here 

^ Coincidentally, this area happens to overlap the absorption 
lines of molecular iodine, a commonly used wavelength calibration 
source, which helps to explain the success of iodine cells in planet 
hunting 
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Fig. 1. — Doppler precision as a function of wavelength range and star temperature for a fixed amount of observing time (60 s). The 
stellar spectra are derived from rotationally-broadened main-sequence templates from 2600-6200K, stepped in 200K increments, and the 
wavelength range is stepped in 100 nm increments. The contours indicate the velocity precision in m/s. From the perspective of velocity 
precision, the best result is achieved in the range of 400-600 nm. The hashed regions correspond to wavelengths where the infrared 
absorption is too high for ground-based observations to be effective. This simulation assumes a 1.28 telescope dish, a spectrograph with 
-R=75000, and sky-to-detector throughput of 10% (The full simulation parameters, including stellar parameters, are given in the Appendix). 
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change the absolute scahng of the velocity precision; it 
will not affect the relative precision between different 
spectral types and wavelength bands. 

There are a number of caveats to this analysis. In 
the infrared bandpass considered (longer than ~650 nm), 
there are are significant absorption features by water and 
oxygen in the Earth's atmosphere, as well as OH emission 
lines, which make ground-based studies more challenging 
or even impossible. We have shaded the areas where in- 
frared absorption is too high for effective observing from 
the ground. For the other regions, this simulation as- 
sumes that these features can be modeled and subtracted 
effectively and have no effect on the velocity precision. At 
high resolution, it is possible to identify telluric features 
and remove them, and recent results suggest that at- 
mospheric calibration can subtract these lines effectively 
(stable to 10 m/s over 6 year timescales), and can even 
use these features as stabl e wavelength referen ces to 2 
m/s over short timescales (jFigueira et al.ll20To() . A no- 
table recent study that includes the effects of iinperfec t 
telluric subtraction can be found in lWang fc Gi Mui) . 
Even so, in our result, the photon errors are a few times 
higher, and dominate the error budget, despite the as- 
sumption of perfect sky subtraction. 

Another important point, not obvious from the plot, is 
that the high average rotation rate (~ 9 km/s) of late M- 
dwarfs imposes a severe penalty on the velocity precision 
obtainable on these stars, as the lines are significantly 
blended at these speeds. Repeating this simulation with 
a fixed rotation rate of 2 km/s for these stars leads to 
about a factor of two higher in precision for the same 
amount of observing time. This is a major penalty, as 
a photon limited observation takes four times as long to 
get twice the velocity precision. Furthermore, the read 
noise is a proportionally larger part of the total signal at 
these lower photon counts. 

The fixed integration time can reasonably be consid- 
ered too severe a restriction, particularly for the cooler 
stars and at longer wavelengths, since similarly sized 
planet can cause a much larger reflex velocity on a smaller 
star. Furthermore, the simulation considered different 
types of stars at a flxed distance from the Earth, which 
ignores the realities of the present day mass function. 
These are both fair points, and we consider the latter 
effect of detectability of planets in the next section. 

3. MAXIMIZING RADIAL VELOCITY SURVEY 
YIELDS 

While it is clear that hotter stars at a fixed distance 
away are more amenable to high velocity precision, the 
goal of most radial velocity surveys is not solely high 
precision, but planet detection. Target selection is im- 
portant. Therefore, there are two major revisions to the 
above analysis. The first is that planets of a given mass 
and orbital distance will cause a larger reflex velocity in 
lower mass stars. This means that less observing time 
will be needed to detect that planet, making lower mass 
stars more attractive targets, as their relative faintness is 
somewhat mitigated. Equally important is the distribu- 
tion of stellar masses as a function of distance from the 
Sun — ie, the probability that a star exists a certain dis- 
tance from the Sun depends on how massive it is. Closer 
stars make better targets because because of the higher 
incident flux, though less massive stars have lower lumi- 



nosities. 

We first quantify the above statements. The refiex 
velocity on a star caused by an orbiting planet is given 
bj0 



5.9 cm/s X — — 



-1/2 



lAU 



(1) 



where Mpi is the mass of the planet. A/* is the mass of the 
star, and a is the orbital distance of the planet (we con- 
sider only circular orbits for simplicity. Also, Mpi should 
always be considered as Mpi sin i to take into account 
inclination). 

Secondly, we consider the amount of observing time 
per night it takes to detect a hypothetical planet which 
causes a given refiex velocity in its parent star. We 
consider a "detection" to be equivalent to a measure- 
ment with velocity precision equal to the reflex veloc- 
ity of the star. A simpler way of saying this is that in 
the limit of many observations evenly spaced out over 
the planetary orbital phase, a single-point precision of 
1 m/s is sufficient to detect a planet causing a 1 m/s 
modulation of its parent star's velocity. We confirmed 
this using white-noise simulations of planet-induced stel- 
lar RV variations detected using both the Lomb-Scargle 
periodogram(Scargic 1982) and a test to reject the 
null hypothesis of constant stellar RV. 

We repeat the calculation of Figure [T] for 10 minute's 
worth of observing time, for a star at 10 pc, from Figure 
[TJ For the more general case, we have a scaling: 



t = 600 s X 



l^lOmin.lOpc 



v^{Mpi,M^,api) 10 pc 



(2) 



where is the reflex velocity of the star; a function 
of the planet mass, stellar mass, and orbital distance, 
given in the previous equation. This expression can be 
derived in two steps. First, recall that the velocity pre- 
cision scales as the inverse of the signal-to-noise ratio, 
which is proportional to the square of the observation 
duration. Of course, this scaling relation only applies in 
regimes where read noise is negligible, but this is true 
in the cases under consideration, where typical observ- 
ing times are many minutes. Second, the signal-to-noise 
is equal to the square root of the number of photons, 
wh ich s cales as the inverse of the distance to the star 
[yjNph oc d~^). Putting these two together gives the 
above expression, which is exact within our framework. 
This last result is interesting in of itself, because it is the 
inverse of planets per unit observing time, which we can 
evaluate as a function of wavelength of observation and 
stellar type. 

Equation 2 depends on the variables Af, , Mpi , api , 
which determine ; on AAo&s and Tog, the bandwidth 
of observation and stellar effective temperature, which 
determine the velocity precision in ten minutes for a 
star at 10 pc; as well as the distance to the star, d. 
To simplify things and remove one parameter, we con- 

This expression can be derived quickly from conservation of 
momentum (M«i;« = MpiVpi) and Kepler's second law (P^ oc a^), 
then scaling to the reflex velocity of Sun caused by the Earth, 8.9 
cm/s. 
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vert freely between stellar mass an d effective tempera- 
ture using the BCAH 98 isochrones (jBaraffe et al.lll998D 
at 2 Gyrs ([Fe/H]=0), which are reasonably accurate for 
physical properties of low-mass stars. In this Tcff range, 
stellar properties are not very sensitive to evolution for 
the first few Gyrs on the main sequence. 

In principle, all that is needed now is a complete list of 
stars of known spectral type and distance from the Sun 
(this gives d and in the above expression), and the 
planet mass/orbital distance distribution as a function 
of spectral type. Then we can calculate the amount of 
time needed to detect a putative planet, as a function of 
observing wavelength range. We can then order the list 
in terms of observing time, which will show what wave- 
length range and spectral type maximizes the number of 
planet detections. 

Unfortunately, the planet mass/orbital distance dis- 
tribution function is not known, and there is hardly a 
complete list of stars in the galaxy with known distances 
and spectral types. Despite this, we can construct a rea- 
sonably accurate stellar census using the RECONS "100 
nearest stars" sample (www.recons.org), which is com- 
plete out to ~7 parsecs. We populate the first 7 parsecs 
of our sample from RECONS. In extrapolating outwards, 
we assume a constant stellar number density per unit vol- 
ume and proceeding outward in spherical shells, drawing 
from an estimate of the present day stellar mass function 
(jReid et al.ll2002t ). As a check of this method, we found 
that this reasonably reproduces the number of stars of 
each spectral type in the RECONS 10 pc sample^, with 
the deviation of our results at the few percent level, con- 
sistent with the statistical variability of the stellar neigh- 
borhood. 

We do not attempt to guess the planet mass/orbital 
distance distribution as a function of spectral type; after 
all, discovering this is one of the goals of planet surveys. 
However, we can apply our analysis to cases where we can 
make reasonable assumptions. First, we consider surveys 
that are complete down to a certain planetary mass and 
orbital period. Second, we focus on searching for planets 
in stellar habitable zones. 

3.1. Surveys complete to limits in planet mass and 
orbital period 

Since RV surveys detect stellar accelerations, for a 
given stellar mass, the quantity Mpia~^/'^ determines the 
necessary velocity precision. We can set this to a con- 
stant and then determine the necessary time to reach this 
precision as a function of stellar mass and distance: 



i = 600 s X 



1'lOmin,10pc X 



(8.9 cm/s) (Mp/a-1/2 = const) 10 pc 



(3) 

A plot of this result with the distance fixed at 10 pc 
and ^ SM^AU"^/^ is shown in Figure [H The 

higher mass stars generally take less time in terms of 
planet detection, and the region of 400-600 nm is still 
the best for a range of stars. 

The full RECONS 10 pc sample has not been released as of the 
submission of this paper, though they have released the number of 
stars of each spectral class in the 10 pc sample. 



To include the effects of real stellar populations, we 
simulate performing such a survey. We apply the results 
of Figure[2]to our simulated sample of stars out to 20 pc. 
For each star, we calculate the amount of observing time 
necessary to get a radial velocity precision lower than the 
reflex velocity caused by the planet. 

In order to maximize the survey yield, we order these 
times in increasing order and select the stars until we 
exceed the amount of observing time per night (9 hrs, 
with 2 minutes of acquisition per target). We plot this 
result in Figure |3] as a function of wavelength range and 
spectral type (for our conversions from mass to spectral 
type, see the appendix). 

The results in Figure |3] show that the best place to 
observe is around 400-500 nm, with the primary targets 
being F, G, and K stars. Redward of 600 nm, early M 
dwarfs become the primary targets, but the number of 
stars that can be observed from night to night is lower 
by a factor of two to four. Repeating this experiment for 
a distance out to 300 pc gives roughly the same results, 
as nearby stars are the most efficient, though there is a 
slight bias towards hotter stars in the visible wavelengths, 
because their luminosities make them observable over a 
larger volume. 

The results here do not consider the effects of stellar 
activity, but it should be pointed out that F stars are 
known to be quite jittery and don't make ideal targets. 
Regardless, they make up a relatively small percentage 
of the total stars, so the results do not change very much. 

3.2. Habitable zone planet surveys 

Next, we consider the case of planets in the habitable 
zones of their parent stars. Here we can avoid the ques- 
tion of planet distribution as a function of orbital dis- 
tance and spectral type. For a star of a given Tcff, we 
can calculate the inner and outer habitable zone, where 
we define the habitable zone to be the region where the 
equilib rium temperature of the planet is between 175 and 
275 K ()Kaltenegger fc Sasselovl(20Tl[) : 



(1 - A)Rl 



-,1/4 



4^a2(l 



(4) 



where we set the albedo, A = 0.5, the eccentricity e = 0, 
and the planetary re-radiation fraction /3 to 1 for non- 
tidally locke d planets and 0.5 for tidally locked planets 
(|Peale|[l~977[ ). We set the radius to be a function of the 
effective temperature of the star, a nd use values gcncr- 
ated from the BCAH 98 isochrones (|Baraffe et al.lll99&) . 
(For main sequence stars in this Tefi range, the phys- 
ical properties of the stars (mass, radius, etc) are not 
greatly affected by stellar evolution over a reasonably 
long timescale, so we simply use the values from the 2 
Gyr isochrones.) With these assumptions in place, we 
can solve for the two values of a, the inner and outer 
habitable zones. We then calculate the reflex velocity of 
the stars caused by these planets, given by 



V* = 8.9 cm/s X — ^ — — 



-1/2 



lAU 



(5) 



once again using the BCAH 98 isochrones to derive a 
consistent mass estimate for the star from the effective 
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Fig. 2. — The time (seconds) to detect (av = K) a planet with Mpia^^^^ = SMgj (1AU)~^/^ , 10 parsecs away, for a range of observing 
wavelengths and stellar effective temperatures. The hashed regions correspond to wavelengths where the infrared absorption is too high 
for ground-based observations to be effective. This simulation assumes a 1.28 m? telescope dish, a spectrograph with /?=75000, and 
sky-to-detector throughput of 10% (The full simulation parameters, including stellar parameters, are given in the Appendix). 
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Fig. 3. — The maximum number of observable stars each night, with the goal of achieving the velocity precision necessary for a detection 
of a planet with Mpja^^^ = SMgjAU^'''^ on each star. This is plotted as a function of observing wavelengths, assuming 9 hours of observing 
time per night and 2 minute acquisit ion time between targets. The survey was simulated using the RECONS 7 pc sample and the present- 
day mass function IReid et al.ll2002) . and extends to 20 pc (going out to 300 pc makes little difference, as bright and nearby stars are the 
most time-efficient targets). 
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temperatures, and placing the planet at the edge of the 
inner HZ. 

We repeat the calculation of the observing time neces- 
sary to detect a 5 planet in the center of the habitable 
zone of its parent star, given by 



t = 600 s X 



'^lOmin.lOpc 



v*{5Mg),M^,aHz) 10 pc 



(6) 



This result gives a notion of habitable planets per unit 
observing time, even if it depends on the distance to the 
star. A plot of this result, for a fixed distance of 10 
parsecs is shown in Figure |4l 

Here, the M-dwarfs easily make the most attractive 
targets, as it takes much less observing time to recover a 
habitable zone planets around them. Furthermore, this 
result ignored systematic noise floors, which will make 
some of the brighter stars problematic as targets for hab- 
itable zone planets, as their reflex velocities can be below 
the noise floor. 

Again, for the simulated stellar neighborhood of 20 pc, 
we calculate the observing time for each star necessary to 
detect a habitable zone planet according to the results of 
Figure HI We assume the same 9 hours of observing time 
per night at 2 minute acquisition time between targets, 
and plot the results in Figure [5l Here, there are dra- 
matically different results compared to the previous case. 
Because the habitable zone orbital distance decreases so 
rapidly for lower mass stars, the corresponding radial 
velocity signal is much larger for the same mass planet. 
This advantage is so pronounced that late M dwarfs be- 
come the primary targets for surveys operating at essen- 
tially all wavelength ranges except the bluest. The best 
wavelength to observe now becomes 700-800 nm, but this 
is somewhat constant for the range of 400-800 nm. Note 
that the absolute number of targets are somewhat higher 
for the habitable zone survey. This is an effect of the ob- 
serving strategy; the observing time for each target is 
set by the required velocity precision, not the ultimate 
velocity precision of the instrument. If one can detect a 
habitable-zone planet at 5 m/s in 1 minute, getting twice 
the velocity precision in four times the observing time is 
not worth it, even if the spectrometer is able to reach the 
precision easily. Figure 5 may give the impression that 
there is little point to moving to the near-infrared, but 
this is somewhat an artifact of the choice of a 5 planet 
as the target and the fact that systematic noise floors are 
not considered yet. For a lower mass planets and more 
massive stars, the radial velocities of the stars are so 
small that they would almost certainly be below the sys- 
tematic noise floor of the instrument, meaning that the 
most promising targets would be in the near infrared, 
where the stellar radial velocities would be higher. See 
Section 4.3 and Figure 9 for further discussion of these 
effects. 

As previously mentioned, we did not consider the 
"contamination" of stellar spectra by telluric lines in 
the Earth's atmosphere, effects which are wavelength- 
dependent. In the case of the ideal survey complete to 
a mass-orbital distance limit, telluric lines are not rele- 
vant, because observations will be taking place in the vis- 
ible wavelengths at 400-600 nm, where the atmosphere 
is mainly transparent. For the habitable-zone survey. 



they will certainly be present in many of the infrared 
regions. However, many telluric lines are stable at the 
the few m/s level, which is often lower than the reflex 
velocities of the low mass stars, the primary targets of 
the survey. This means telluric lines could be used as 
broadband wavelength references. Due to the potential 
of M-dwarf infrared surveys, efforts put towards improv- 
ing t elluric referencing an d calibration are highly impor- 
tant (IBlake fc Shaw|[20ll . 

Notably, we did not consider stellar jitter caused by os- 
cillations, granulation, or activity such as spots. For the 
interested reader, a t horough exploration of th ese sub- 
jects can be found in iDumusaue et al.l (|2011bl lal) where 
these effects are analyzed in detail for solar-type stars, 
and observing strategies/corrections to mitigate different 
kinds of jitter are explored. Of these three effects, the lat- 
ter is considered the most troublesome in terms of planet 
detection, as the characteristic timescales of spot-related 
jitter are similar to planetary periods. The simulations in 
this paper deal primarily with statistical errors and their 
dependence on observing wavelengths and stellar effec- 
tive temperatures; the effects of stellar jitter are cate- 
gorically different, as they are real radial velocity signals 
with the potential to confuse the actual planetary sig- 
nal. Furthermore, the spectral-type dependence of spot 
number, size, and shape are not well constrained, making 
meaningful simulation of these effects difficult within our 
framework. It is possible that spot jitter would skew the 
results above; for example, if one spectral type typically 
has spot distributions that are extremely stable and sim- 
ilar to planetary signals, whereas another has spots that 
are easily distinguishable from planetary signals, then 
the latter would be preferable to the former. Of course, 
this would be dependent on the ability of the data re- 
duction and observing strategy to distinguish spot jitter 
from true planetary signals, whereas the results above 
show more funda mental limitations . With this in mind , 
recent advances (jLanza et al.l 120111 : lAigrain et al.l I2012D 
have shown a promising ability to subtract out spot jitter 
with the combination of high-precision photometry. Per- 
haps most encouragingly, by modeling stellar effects, Du- 
musque and collaborators were able to discover an Earth 
mass planet around a Centauri B, a star with a "stellar 
noise" level many times higher than the planetary signal 
pumusaue et ai.ll2012D . 

4. INSTRUMENTAL EFFECTS AND SYSTEMATIC 
NOISE FLOORS 

The most unrealistic assumption so far is the assump- 
tion of a perfect instrument in our hypothetical survey, 
as all past, present, and future instruments have a limit- 
ing precision. In practice, this means that after a certain 
point, increasing the exposure time does not lead to an 
increase in velocity precision. 

Up to this point, we have assumed that we have a 
particular instrumental configuration; a resolution 75000 
spectrometer with 3 pixel sampling and a velocity pre- 
cision limited by the signal-to-noise ratio and read noise 
(which is negligible for most cases). We first consider 
our choices of resolving power and sampling, and how 
they contribute to the velocity precision. Furthermore, 
we consider the effects of instrumental instabilities and 
show how they can lead to systematic noise floors. Fi- 
nally, we assess the effects of these noise floors on our hy- 



8 




500 1000 1500 2000 2500 

Wavelength(nm) 

Fig. 4. — The time (seconds) to detect (c^ = K) a 5 Mjg planet in the habitable zone of its parent star, 10 parsecs away, for a range of 
observing wavelengths and stellar effective temperatures. The hashed regions correspond to wavelengths where the infrared absorption is 
too high for ground-based observations to be effective. This simulation assumes a 1.28 m^ telescope dish, a spectrograph with i?= 75000, 
and sky-to-detector throughput of 10% (The full simulation parameters, including stellar parameters, are given in the Appendix). 




Fig. 5. — The maximum number of observable stars each night, with the goal of achieving the velocity precision necessary for a detection 
of a 5 Mg5 habitable-zone planet on each star. This is plotted as a function of observing wavelengths, assuming 9 hours of observing time 
per night and 2 m i nute acquisition time between targets. The survey was simulated using the RECONS 7 pc sample and the present-day 
mass function llReid et al.ll2002f) . and extends to 20 pc (going out to 300 pc makes little difference, as bright and nearby stars are the most 
time-efhcient targets). 
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pothetical survey, and show how they can substantially 
affect the optimal target selection. 

4.1. Effects of Spectrometer Resolution and Sampling 

The choice of resolving power of a spectrograph is im- 
portant, as more sharply resolved lines lead to higher 
Doppler precision. A simple analytic calculation for one 
line suggests that the accuracy should scale as cr^ oc R~^, 
which was approximately reproduced in an early study 
(|Hatzes fc Cochranlll992f ). 

We examine the effect of spectrometer resolution on 
velocity precision at a fixed exposure time on a Sun-like 
(Teff=5800 K, log g = 4.5 and [Fe/H]=0) star. We sim- 
ulate one thousand measurements at resolving powers 
ranging from 10,000 to 150,000, in the wavelength range 
of 500-600 nm, and as before, take the standard deviation 
of the velocity measurements as our velocity precision. 
The results are shown in Figure [6l 




Resolution, A/AA 

Fig. 6. — The resulting velocity precision and signal-to-noise ra- 
tio for an observation of a Sun-like star with varying resolution, 
(3.0 samples/resolution element). Note that the units on the verti- 
cal axis are m/s for the blue curve (velocity precision), and unitless 
for the red curve (signal- to- noise ratio) . The exposure time is held 
fixed, and the resulting signal-to-noise ratio decreases as resolution 
increases, since less photons are incident per pixel. Increasing the 
resolution always improves the velocity precision, but the point of 
diminishing returns is reached at about R = 45,000, which corre- 
sponds to the point where almost all the spectral lines are fully 
resolved. 

Our results show that the best-fitting power law is CTu oc 
R""'^'^, and that diminishing returns appear at about 
R=45000, where the spectral lines are resolved. It should 
be mentioned that the power in the proportionality is 
a function of the spectral quality (for example, the line 
density), but we were unable to find much deviation from 
the value of 1.2 in interesting spectral regions. We were 
not able to find any region where the power was as high 
as 1.5. Additionally, although precision improves with 
higher resolution, resolving powers above 100,000 yield 
little additional benefit, as essentially all the lines are 
resolved. 

Satisfying the Nyquist sampling theorem requires that 
at least 2 pixels cover each resolution element, but more 
pixels may be used. Increasing the sampling turns out 
to have no effect on velocity precision, assuming the ex- 
posure time stays constant. The reason for this may be 
seen in that the number of photons per pixel is reduced 
by a factor of N, where N is the number of pixels per 
resolution element; and the number of pixels in the data 



product is increased by the same factor. Since the ra- 
dial velocity precision is inversely proportional to the to 
signal-to-noise ratio (oc N^^"^) and to the square root of 
the bandwidth, these effects cancel out. We verified this 
(non)effect with numerical simulations. We point out 
that in the limit of extreme (cm/s) velocity precision, it 
is not safe to ignore pixel topology effects, as the pixels 
may vary in efficiency over the center to the edge. This 
can become a problem when the line-spread function is 
minimally sampled. 

Taken together, these two results show that our choice 
of resolution 75000 and sampling of 3 pixels in our sim- 
ulated instrument was reasonable, and did not affect the 
recovery of radial velocities adversely. 



4.2. Effects of incompletely recovered instrumental 
profiles 

In general, the measured output from a spectrograph 
is the intrinsic spectrum of the object convolved with the 
instrumental profile (IP) or line spread function (LSF) of 
the instrument: 



to(A) 



s(A')IP(A - \')d\' 



In this equation, m(A) is the measured spectrum, IP (A) 
is the instrumental profile (which is normalized to unity 
by conservation of flux), and s(A) is the "true" stellar 
spectrum. The instrumental profile is fiducially a gaus- 
sian with a fuU-width-half-max equal to the resolution of 
the spectrograph. 

A typical extraction of a radial velocity datapoint in- 
volves consideration of the entire spectral region. First, 
the instrumental profile is extracted from a wavelength 
reference source, and after the observation, this instru- 
mental profile is deconvolved from the stellar spectrum. 
In cases of simultaneous calibration (as with an iodine 
cell), the full transmission spectrum is modeled. Finally, 
the radial velocity datapoint is extracted from the shift 
in the spectrum with respect to the wavelength solution. 

Properly characterizing the IP of the spectrometer is 
a challenging task, b ut is essential to r ecovering radial 
velocities accurately (jValenti et al.lll995l) ), especially as 
all further steps depend on its characterization. It is 
a function of the optical path, and hence depends on 
environmental parameters like temperature and pressure, 
as well as slit illumination and focus. These parameters 
can change during and between observations, and thus 
the IP must be recalculated for each observation; that 
is, the IP varies with time. Furthermore, since optical 
elements have wavelength-dependent properties, the IP 
is wavelength-dependent. This means that in practice 
it is necessary to model an IP varying with wavelength, 
rather than a constant one. It is clear that any change 
in the IP that can be accounted for and modeled is not 
relevant. However, any changes not accounted for will be 
interpreted as radial velocity shifts-this is due to having 
two different IPs; the physical instrument profile, and the 
approximation that is deconvolved from the observation. 

To model this, we convolve our model spectrum with a 
perturbed IP. While it is obviously not feasible to exam- 
ine every possible perturbation, we can derive useful rules 
of thumb from characterizing simple cases. We begin by 
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restricting ourselves to a gaussian LSF with equivalent 
resolution of 75000 and 3.0 samples/resolution element 
as our ideal IP. We simulate the effects of LSF mismatch 
by convolving with a stellar spectrum (Teff=5800 K, log 
g=4.5 [Fe/H]= 0) with the perturbed spread function 
and trying to recover the velocity shift (which should be 
zero) with respect to the spectrum convolved with the 
ideal IP. As before, we repeat this many times and take 
the standard deviation of our derived velocities as our 
velocity precision. 

In the first case, we consider consider a gaussian LSF 
with some skew added. Skewness (73) is measure of 
asymmetry of the distribution, and is a property that 
can be straightforwardly calculated for a particular in- 
strumental profile. Physically, skewness in the IP results 
when a source moves perpendicular to the slit direction 

Mathematically, the skewness of a function is given by: 



73 = E 



X 



(7) 



This equation is general and applies to any distribution. 
For our purposes, we consider the skew-normal distribu- 
tion with parameter a, given by the function 



f{x) = 2(i)(x)^{ax) 



where 



<j,{x) 



/2tt 



(8) 



(i){t)dt (9) 



Note that the skew parameter a is not actually equal to 
the skewness of the distribution, which is a complicated 
function of a: 
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l73|^ + ((4-7r)/2)^ 



(10) 



where the skewness (73) is recovered by inverting the 
equation above for a. Despite the rather opaque equa- 
tion above, the usefulness of this parametrization is that 
it is, in a sense, a "simple" way of adding skewness in 
a distribution, and for a = one recovers the normal 
distribution, which is our fiducial way of representing an 
instrumental profile. 

It is apparent from Figure [7] how a velocity floor can 
arise from an uncorrected skewed IP. For example, at a 
skewness of 0.05, it will be impossible to do better than 
5 m/s in precision, regardless of the signal-to-noise ratio. 

In the second case, we consider an IP with a small gaus- 
sian perturbation with varying amplitude. The choice 
of this form of perturbation is due to the fact that a 
common practical way to represent an imperfect IP is 
through many small gaussian functions added together 
on top o f the main g au s sian IP (Bu tler & Marcv 1996; 
lEndl et ali [2000i: fKaml)e"eral.i [2001 . The position of 
the perturbation is set to vary normally with a standard 
deviation equal to the standard deviation of the ideal IP 
(1). Also, the width of the perturbation is fixed to be 



A real 
Herschel 



e xample of 



thi s effe ct 
I20T0D 



can be found in 
observer's manual. 



the Herschel IPilbratt et al.l 

http: //herschel. esac, esa.int/Docs/PACS/html/ch04s07.html. 
particularly figure 4-17. 




Fig. 7. — The resulting velocity precision for observation of a Sun- 
like star with R= 75000 (3.0 samples/resolution element), and skew- 
ness varying from 10~^ to 10~^. For reference, the inset shows a 
gaussian with skewness (not a) of 0.3, an order of magnitude higher 
than the maximum value considered (none of the skew-normal dis- 
tributions simulated have skews large enough to be visually distinct 
from a normal distribution.) The skewness sets a signal- to- noise 
floor when it is greater than a part in 100, weakly dependent on 
signal to noise. The flattening out of the curves occurs where the 
signal-to-noise ratio limits the velocity precision. 




0.01 0.10 1.00 10.00 

I0 IP amplitude perturbation (% of IP peak) 

Fig. 8. — The resulting velocity precision for observation of a 
Sun-like star with R=75000 (3.0 samples/resolution element), and 
perturbation amplitude varying from 10"* to 10~^ of the peak 
amplitude of the LSF. For comparison, the inset shows a gaussian 
with a perturbation of 10%, equal to the maximum value consid- 
ered. It is clear that for sub meter/sec precision, it is important 
that the perturbation amplitude of the distribution does not ex- 
ceed 0.1 %, a value weakly dependent on the signal-to-noise ratio. 
The flattening out of the curves occurs where the signal-to-noise 
ratio limits the velocity precision. 



of a characteristic size of one of the mini-gaussians used 
to model the IP. We examine the effects of the perturba- 
tion amplitude on the velocity precision under the same 
conditions as the previous test. 

The result of this simulation, in Figure El reinforces 
how little tolerance there is in terms of characterizing the 
instrumental profile. A perturbation as small as 3 % IP 
peak is able to set a velocity fioor of 5 m/s, independent 
of the signal-to-noise ratio. 

4.3. Effects of noise floors on survey yields 

The preceding section demonstrates the need of a stiff 
combination of stabilizing the IP through temperature, 
pressure, and illumination control and immediately cap- 
turing any changes that occur. However, it is reasonable 
to assume that there will be some velocity floor in ev- 
ery survey. We examined the effect by putting arbitrary 
noise floors at different velocities. We repeated our sim- 
ulated habitable-zone survey, except removed stars from 
the target list if their predicted planetary signal was be- 
low the noise floor (for the other survey, the very ex- 
istence of noise floors negates its completeness). This 
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makes massive stars become inaccessible targets, and 
they are progressively replaced by the next less massive 
stars as the noise floor increases, and observations go to 
the limiting precision. There are less targets per night as 
well, though the decrease is not as severe as one would 
think, as removing stars frees up available observing time 
for other targets within the detection limits: G stars re- 
place F stars, K stars replace G stars, and so on. 

We simulated this effects for noise floors of 0.5, 1, 
3, and 5 m/s. The results are shown in Figure [9l 
which demonstrates that mid and late-M dwarfs be- 
come better targets as precision decreases, and that the 
number of stars observable per night does not decrease 
substantially-after about 800 nm, the amount of observ- 
able stars in a given night is set primarily by the duty 
cycle. Additionally, the wavelength range of 700-800 nm 
is the best overall in terms of total number of targets. 

5. CONCLUSION/DISCUSSION 

We have investigated the design requirements and per- 
formance expectations for radial velocity surveys, deriv- 
ing the best wavelength bands and targets for different 
survey goals. For surveys targeting completeness out to 
a particular planetary mass-orbital distance product, the 
best targets are F, G, and K dwarfs observed at wave- 
lengths spanning 400-600 nm. For surveys of habitable- 
zone planets, the best targets are late M dwarfs in the 
wavelength range of 700-800 nm, though the number of 
possible targets stays flat from 400-800 nm. Of the two 
survey methods, those searching for habitable-zone plan- 
ets are more productive, as the larger expected radial- 
velocity signals lead to more targets, and hence more 
detections. These results are based on the considera- 
tion of a number of input parameters, within the frame- 
work of maximizing planet detections in a fixed amount 
of observing time per night. In particular, we considered 
how the spectral quality, stellar flux, photon noise, stel- 
lar and planetary mass, and stellar mass distribution in 
the galaxy play against each other to produce different 
ideal targets. 

For surveys that are complete to a constant Mpja"^/^, 
brightness of targets is paramount: the best targets are 
hotter stars, and the visible wavelengths where these 
stars' spectral energy distributions peak, coincident with 
where the density of absorption lines is highest. The 
target selection result holds even when considering the 
present-day stellar mass function. Generally, the observ- 
ing strategy consistent with this analysis is to target as 
many bright, non-jittery stars as possible out to a limit- 
ing magnitude (or volume), then move to lower masses. 

In the case of habitable zone planet surveys, our re- 
sults demonstrate the potential of infrared surveys of M 
dwarfs. For an observing program that has relatively 
modest velocity precision, it is possible to have a survey 
of M dwarfs that is complete within the limits of the hab- 
itable zone. Note that this requires a different sort of ob- 
serving strategy than one where each observation is taken 
to the noise floor of the velocity precision. This would 
make sense for F, G, and K stars, as the reflex velocities 
for habitable-zone candidates are very low. However, for 
M-dwarfs, this is inefficient, as habitable-zone velocity 
precisions are higher than systematic instrumental limi- 
tations. Since the velocity precision scales approximately 
as the square of the observing time, getting excessive 



precision will dominate the nightly time budget. This 
directly penalizes the number of target stars in the limit 
of complete phase coverage. A better choice is to observe 
only until the target precision is reached, and then move 
on to the next target. 

We demonstrated how instrumental imperfections can 
lead to systematic noise floors, gave quantitative pre- 
scriptions for the level of stability needed in a restricted 
number of cases, and explored how these effects would 
change the scope of a habitable-zone planet survey. Sig- 
nificantly, the number of potential targets is highly de- 
pendent on the noise floor for shorter wavelengths, but 
is basically unchanged for longer wavelengths. The main 
change is that the target stars become later and later, 
demonstrating the high potential of infrared surveys. 
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der) Prof. Lynne Hillenbrand (CIT), for providing guid- 
ance and expertise; Dr. Sasha Hinkley and Sebastian 
Pineda (CIT), for helpful critique and discussion; Dr. 
Peter Plavchan (IPAC), for detailed comments and per- 
spective on early drafts of the paper. We especially thank 
the anonymous referee, whose suggestions improved the 
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Fig. 9. — The maximum number of observable stars as a function of observing wavelengths, assuming 9 hours of observing time per 
night and 2 minute acquisi tion time between targets. The survey was simulated using the RECONS 7 pc sample and the present-day mass 
function d Reid et al.||2003) . and extends to 20 pc (going out to 300 pc makes little difference, as bright and nearby stars are the most 
time- efficient targets). These graph assumes a survey targeting habitable zone planets, with velocity precisions limited to 0.5 (top), 1, 3, 
and 5 m/s. Furthermore, the region of 0.7-0.8 /i. m is the best area to observe overall. 
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APPENDIX 

Simulation Parameters 

The following table gives the parameters in our simulation of velocity precision in Figure [T] It is stated in the text 
when any of these parameters are changed. 



Simulation parameters 


Property 


Default Value 


Unit 


Stellar atmosphere models 


PHOENIX 


BT-Settl 2009 




[Fe/H] 
log g 


0.0 




2600 K < Teff < 3400 K 


5.0 




3600 K< Teff < 5800 K 


4.5 




5800 K < Teff 


4.0 




a-enhancement 


0.0 


Stellar isochrone models 


BCAH (1998) 


2 Gyr 


Stellar rotation rate, usini 






2600 K< Teff < 2800 K 


9.0 


km/s 


2800 K< Teff < 3200 K 


6.0 


km/s 


3200 K< Teff < 3800 K 


3.0 


km/s 


3800 K< Teff 


2.0 


km/s 


Distance 


10 


pc 


Telescope area 


1.28 


m^ 


Observing time 


60 


s 


Spectrograph resolution 


75000 




Spectrograph sampling 


3.0 


pixels per resolution element 


Throughput (sky to detector) 


10 


% 


Read noise 


5 


electrons 


Cross-dispersion 


10 


pixels 



Spectral class conversions 



Spectral Class 


Mass (Mq) 


Late M 


0.08-0.23 


Early M 


0.23-0.51 


K 


0.51-0.79 


G 


0.79-1.05 


F 


1.05-1.6 
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